
SEQUENCE LISTING ^ 

\ : t 

(1) GENE3AL INFORMATION: 

(i) APPLICANT: Quertermous, Thomas 

Hogan, 3rigid 
Snodgrass, H. Ralph 
Zupancic, Thomas J. 

(ii) TITLE OF INVENTION : DE VELO PMENTALL Y - REGULATED ENDOTHELIAL 
CELL LOCUS -1 

(iii) NUM3ER OF SEQUENCES: 2 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSES: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 10035-2711 

(v) COMPUTER READA3LE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Pa ten tin Release =1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To 3e Assigned 
(3) FILING DATE: 05-JUN-199S 
(C) CLASSIFICATION: " 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Poissant, Brian M. 

(3) REGISTRATION NUM3ER: 23,462 

(C) REFERENCE /DOCKET NUMBER: 8 907-034 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 790-9090 
(3) TELEFAX: (212) 869-8854/9741 
(C) TELEX: 66141 Pennie 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie He Thr Gin Gly Xaa 
1 5 io" 15' 
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Xaa Ala Lys Asp Phe Gly Asp Val Leu Phe Val Giv Ser Tvrt^vs Leu 

20 25 * ■ 30 ,1 ' 

Ala Tyr Ser Asn Asp Gly Glu His Tro Met Val His Gin Aso Glu Lys 
35 40 45 

Gin Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Aso Asn Aso Thr His 
50 55 S0~ 

Arg Lys Asn Val lie Asp Pro Pro He Tyr Ala Arg Phe He Arg He 
65 70 75 80 

Leu Pro Leu Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Asp Leu Gly Ser Ser Lys Glu Val Thr Gly He lie Thr Gin Gly Xaa 
1 5 10 is 

Xaa Ala Arg Asn Phe Gly Ser Val Gin Phe Val Ala Ser Tyr Lys Val 

20 25 30 

Ala Tyr Ser Asn Asp Ser Ala Asn Trp Thr Glu Tvr Gin Asd Pro Arg 
35 40 45 

Thr Gly Ser Ser Lys Val Phe Gin Gly Asn Leu Aso Asn Asn Ser His 
50 55 SO* 

Lys Lys Asn He Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val 
65 70 75 80 

Leu Pro Val Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Asp Leu Leu Lys He Lys Lys He Thr Ala He He Thr Gin £Iy Xaa 

Xaa -Cys Lys Ser Leu Ser Ser Glu Met Tyr Val Lvs Ser Tvr Thr He 

20 25 - 30 

His Tyr Ser Glu Gin Gly Val Glu Trp Lys Pro Tyr Arg Leu Lys Se- 
35 40 45 

Ser Met Val Asp Lys He Phe Glu Gly Asn Thr Asn Thr Lys Gly His 
50 55 60 

Val Lys Asn Phe Phe Asn Pro Pro He He Ser Arg Phe He Arg Val 
65 70 75 80 " 

He Pro Lys Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(8) TYPE : amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRI 

Asp Leu Gin Lys Thr 
1 5 

Xaa Val Lys Ser Leu 

20 

Ser Ser Ser Gin Asd 
35 

Asn Gly Lvs Val Lys 
50 

Met Met Asn Ser Leu 
55 

Xaa Xaa Xaa Xaa Xaa 

85 



TION: SEQ ID NO:4: 

Met Lys Val Thr Gly He 

10 

Phe Thr Ser Met Phe Val 

25 



Gly His His Trn Thr Xaa 
40 



Val Phe Gin Gly Asn Gin 
55 



Asp Pro Pro Leu Leu Thr 
70 75 



He Thr Gin Gly Xaa 

15 

Lys Glu Phe Leu He 
30 

Xaa Gin He Leu Tyr 
45 

Aso Ser Ser Thr Pro 
60*" 

Arg Xaa Xaa Xaa Xaa 

80 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : ^ : .T 

Asp Leu Glu Asn Leu Arg Phe Val Ser Gly lie Glv Thr Gin Gly Ala 
1 5 10 15 

He Ser Lys Glu Thr Lys Lys Lys Tyr Phe Val Lvs Ser Tyr Lys Val 

20 25 36 

Asp He Ser Ser Asn Gly Glu Asp Trp He Xaa Xaa Thr Leu Lvs Glv 
35 40 45 

Asp Asn Lys His Leu Val Phe Thr Glv Asn Thr Asp Ala Thr Asd Val 
50 55 60 

Val Tyr Arg Pro Phe Ser Lys Pro Val He Thr Arg Phe Val Arq Leu 
65 70 75 so 

Arg Pro Val Thr Tro 

85' 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 

Asp Leu Ala Glu Glu Lys He Val Arg Glv Val He He Gin Glv Xaa 
1 5 io 15 " 

Xaa Gly Lys His Lys Glu Asn Lys Val Phe Met Arg Lvs ^he Lvs He 

20 25 30 

Gly Tyr Ser Asn Asn Gly Thr Glu Trp Glu Met He Met Asd Ser Ser 
35 40 45 

Lys Asn Lys Pro Lys Thr Phe Glu Gly Asn Thr Asn Tyr Asd Th- Pro 
=>° 55 60 

Glu Leu Arg Thr Phe Xaa Ala His He Thr Thr Glv Phe He A-o He 
6:3 70 75 * ~ y 80 

He Pro Xaa Xaa Xaa 

85 

(2) INFORMATION F0 R SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : Deotide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Cys Glu Val Pro Arg Thr Phe Met Cvs Val Ala Leu Gin Glv Xaa 
15 10 15* 

Xaa Xaa Arg Gly Xaa Asp Ala Asp Gin Tro Val Thr Ser Tyr Lys lie 

20 ~ " 25 " 30 

Arg Tyr Ser Leu Asp Asn Val Ser Trp Phe Xaa Xaa Xaa Xaa Xaa Glu 
35 40 " 45 

Tyr Arg Asn Gly Ala Ala lie Thr Gly Val Thr Aso Arg Asn thr Val 
50 55 50~ 

Val Asn His Phe Phe Asp Thr Pro lie Arg Ala Arg Ser lie Ala He 
65 70 75 80 

His Pro Leu Thr Xaa 

85 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Asp Leu Xaa Xaa Xaa Xaa Xaa Val Thr Gly He He Thr Gin Gly Xaa 
15 10 15* 

Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa Phe Val Xaa Ser Tvr Lys He 

20 25 30 

Xaa Tyr Ser Xaa Asp Gly Xaa Xaa Tro Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Lys Xaa Lys Val Phe Xaa Gly Asn Thr Aso Xaa Xaa Thr Xaa 
50 55 50~ 

Xaa Xaa Asn Xaa Phe Xaa Xaa Pro lie Xaa Xaa Arg Phe lie Arg Xaa 
65 70 75 ' 80 

Xaa Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 9 : 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2303 base pairs 
(3) TYPE : nucleic acid"" 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA ^ 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 619.. 2058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCCGGT TAACTGAGGA CAAAGGGTAA TGCAGAAGTG ATATTTGATT TCCATTCTCA 60 

TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 120 

CCTGCGTCTC ATATTTCTGC ATGCTGCTTT GTTTGTATAT AGTGCGCTCC TGGCCTCAGG 180 

CTCGCTCCCC TCCAGCTCTC GCTTCATTGT TCTCCAAGTC AGAAGCCCCC GCATCCGCCG 240 

CGCAGCAGCG TGAGCCGTAG TCACTGCTGG CCGCTTCGCC TGCGTGCGCG CACGGAAATC 300 

GGGGAGCCAG GAACCCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 360 

AGCCCCAGCC TCTCTCAAGC GCACCCACCT CCGCGCACCC CAGCTCAGGC GAAGCTGGAG 420 

TGAGGGTGAA TCACCCTTTC TCTAGGGCCA CCACTCTTTT ATCGCCCTTC CCAAGATTTG 480 

AGAAGCGCTG CGGGAGGAAA GACGTCCTCT TGATCTCTGA CAGGGCGGGG TTTACTGCTG 5 540 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA GCCCAGGTCA 600 

CGTCCGTGAG AAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG GTT 651 

Met Lys His Leu Val Ala Ala Trr> Leu Leu Val 
1 5 ~ 10 

GGA CTC AGC CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC 599 
Gly Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Aso lie Cys Asn 

15 20 - ' * 25 

CCG AAC CCC TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT 74 7 

Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Aso 
30 3 5 40 

GAT TCC TTT TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG AAC TGC 795 
Asp Ser Phe Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys 
45 50 55 

TCT AGT GTT GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA 843 
Ser Ser Val Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala 
60 65 70 75 

GGT CCC TGC ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA 891 
Gly Pro Cys lie Pro Asn Pro Cys His Asn Glv Glv Thr Cys Glu lie 

80 85 ' * 90 

AGC GAA GCC TAT CGA GGA GAC AC A TTC ATA GGC TAT GTT TGT AAA TGT 939 
Ser Glu Ala Tyr Arg Gly Asp Thr Phe lie Gly Tvr Val Cvs Lys Cys 

95 100 ' * 105- 

CCT CGG GGA TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT 987 
Pro Arg Gly Phe Asn Gly lie His Cys Gin His Asn He Asn Glu Cys 
HO 115 120 

GAA GCT GAG CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT 1035 
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Glu Ala Glu Pro Cys Arg Asn Gly Gly He Cvs Thr Aso Leu Val A*Sl 
125 130 \ 135 " A 

AAC TAC TCT TGT GAA TGC CCA GGA GAA TTT ATG GGA CGA AAT TGT CAA 1083 
Asn Tyr Ser Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin 
140 145 150 155 

TAT AAA TGC TCT GGG CAC TTG GGA ATC GAA GGT GGG ATC ATA TCT AAT 1131 
Tyr Lys Cys Ser Gly His Leu Gly He Glu Gly Gly He He Ser Asn 

160 165 170 

CAG CAA ATC ACA GCT TCA TCT AAT CAC CGA GCT CTT TTT GGA CTC CAG 1179 
Gin Gin He Thr Ala Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin 

175 180 * 185 

AAG TGG TAT CCC TAC TAT GCT CGA CTT AAT AAG AAG GGC CTT ATA AAT 1227 
Lys Trp Tyr Pro Tyr Tyr Ala Arg Leu Asn Lvs Lys Gly Leu He Asn 
190 195 200 

GCC TGG' ACA GCT GCT GAA AAT GAC AGA TGG CCA TGG ATT CAG ATA AAT 127 5 

Ala Trp Thr Ala Ala Glu Asn Asn Arg Tro Pro Trp He Gin He Asn 
205 210 * 215 

TTG CAA AGA AAA ATG AGA GTC ACT GGT GTT ATT ACC CAA GGA GCA AAA 132 3 

Leu Gin Arg Lys Met Arg Val Thr Glv Val lie Thr Gin Gly Ala Lys 
220 225 230 235 

AGG ATT GGA AGC CCA GAG TAC ATA AAA TCC TAC AAA ATT GCC TAC AGC 1371 
Arg He Gly Ser Pro Glu Tyr He Lys Ser Tyr Lys He Ala Tyr Ser 

240 245 * - 250 

AAT GAC GGG AAG ACC TGG GCA ATG TAC AAA GTA AAA GGC ACC AAT GAA 1419 
Asn Asp Gly Lys Thr Trp Ala Met Tvr Lys Val Lys Gly Thr Asn Glu 

255 260 265 

GAG ATG GTC TTT CGT GGA AAT GTT GAT AAC AAC ACA CCA TAT GCT AAT 1467 
Glu Met Val Phe Arg Gly Asn Val Aso Asn Asn Thr Pro Tyr Ala Asn 
270 275 280 

TCT TTC ACA CCC CCA ATC AAA GCT CAG TAT GTA AGA CTC TAC CCC CAA 1515 
Ser Phe Thr Pro Pro He Lys Ala Gin Tyr Val Arg Leu Tyr D ro Gin 
285 290 295 

ATT TGT CGA AGG CAT TGT ACT TTA AGA ATG GAA CTT CTT GGC TGT GAG 1563 
He Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu 
300 305 310 " 315 

CTC TCA GGC TGT TCA GAA CCT TTG GGG ATG AAA TCA GGG CAT ATA CAA 1511 
Leu Ser Gly Cys Ser Glu Pro Leu Glv Met Lvs Ser Gly His He Gin 

320 325 " 330 

GAC TAC CAG ATC ACT GCC TCC AGC GTC TTC AGA ACA CTC AAC ATG GAC 1559 
Asp Tyr Gin He Thr Ala Ser Ser Val Phe Arg Thr Leu Asn Met Aso 

335 340 * 345 

ATG TTT ACT TGG GAA CCA AGG AAA GCC AGG CTG GAC AAG CAA GGC AAA 1707 
Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Aso Lvs Gin Glv Lvs 
350 355 '350 

GTA AAT GCC TGG ACT TCC GGC CAT AAC GAC CAG TCA CAA TGG TTA CAG 175 5 

Val Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Tro Leu C^n 
355 370 ~ 375 
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GTT GAT CTT CTT GTC CCT ACT AAG GTG AC A GGC ATC ATT AC A CAA 35 A 1303 
Val Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Glrv Gly 
3 80 385 390 395 

GCT AAA GAT TTT GGT CAC GTG CAG TTT GTT GGG TCA TAC AAA CTA GCT 1851 
Ala Lys Asp Phe Gly His Val Gin Phe Val Glv Ser Tyr Lvs Leu Ala 

400 405 ' " 410 

TAC AGC AAT GAT GGA GAA CAC TGG ATG GTG CAC CAG GAT GAA AAA CAG 1899 
Tyr Ser Asn Asp Gly Glu His Trp Met Val His Gin Asp Glu Lys Gin 

415 420 425 

AGG AAA GAC AAG GTT TTT CAA GGC AAT TTT GAC AAT GAC ACT CAC AGG 1947 
Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg 
430 435 ~ 440 

AAA AAT GTC ATC GAC CCT CCC ATC TAT GCA CGA TTC ATA AG A ATC CTT 1995 
Lys Asn Val He Asp Pro Pro He Tyr Ala Arg Phe He Arg He Leu 
445 450 455 

CCT TGG TCC TGG TAT GGA AGG ATC ACT CTG CGG TCA GAG CTG CTG GGC 2043 
Pro Trp Ser Trp Tyr Gly Arg He Thr Leu Arg Ser Glu Leu Leu Gly 
460 465 470 475 

TGC GCA GAG GAG GAA TGAAGTGCGG GGCCGCACAT CCCACAATGC TTTTCTTTAT 209 8 

Cys Ala Glu Glu Glu 

480 

TTTCCTATAA GTATCTCCAC GAAATGAACT GTGTGAAGCT GATGGAAACT GCATTTGTTT 2158 
TTTTCAAAGT GTTCAAATTA TGGTAGGCTA CTGACTGTCT TTTTAGGAGT TCTAAGCTTG 2218 
CCTTTTTAAT AATTTAATTT GGTTTCCTTT GCTCAACTCT CTTATGTAAT ATCACACTGT 2278 
CTGTGAGTTA CTCTTCTTGT TCTCT 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Drotein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys His Leu Val Ala Ala Trt> Leu Leu Val Glv Leu Se- Leu Gly 
1 5 10 * 15 

Val Pro Gin Phe Gly Lys Gly Asp He Cys Asn Pro Asn Pro Cvs Glu 

20 25 30 * 

Asn Gly Gly He Cys Leu Ser Gly Leu Ala Aso Aso Ser Phe Ser Cvs 
3 5 40 ~ 45 

Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val Val Glu 
30 55 50 

Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Glv Pro Cvs ti s d*- q 
65 ™ 75 ' " ~80 



2303 
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Asn Pro Cys His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Aff 

85 90 l . 95 

Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cvs Pro Arg Gly Phe Asn 

100 105 * no 

Gly He His Cys Gin His Asn He Asn Glu Cys Glu Ala Glu *>ro Cys 
115 120 125 

Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 
130 135 140 

Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cvs Ser Gly 
145 150 155 ^ "* 160 

His Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin He Thr Ala 

165 170 175 

Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin Lys Trt> Tyr Pro Tyr 

180 185 " 190 

Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Tro Thr Ala Ala 
195 200 205 

Glu Asn Asp Arg Trp Pro Trp He Gin He Asn Leu Gin Arg Lys Met 
210 215 220 

Arg Val Thr Gly Val He Thr Gin Gly Ala Lvs Arg He Gly Ser Pro 
22 5 230 23 5 240 

Glu Tyr He Lys Ser Tyr Lys He Ala Tyr Ser Asn Aso Glv Lvs Thr 

245 250 ' 255 

Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Glu Met Val °he A-g 

260 265 270 

Gly Asn Val Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro 
275 280 285 

He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin lie Cvs Arg Arg His 
290 295 300 

Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Glv Cvs Ser 
305 310 315 * ~ 320 

Glu Pro Leu Gly Met Lys Ser Gly His He Gin Aso Tyr Gin He Thr 

325 330 " 335 

Ala Ser Ser Val Phe Arg Thr Leu Asn Met Aso Met Phe Thr Tro G^u 

340 345 350 

Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Tro Thr 
355 360 355 

Ser Gly His Asn Asp Gin Ser Gin Tro Leu Gin Val Aso Leu Leu Va^ 
370 375 380 

Pro Thr Lys Val Thr Gly He He Thr Gin Glv Ala Lvs Aso ^he Glv 
385 ■ 390 395 " " 400 

His Val Gin Phe Val Gly Ser Tyr Lvs Leu Aia Tvr Ser Asn Aso Glv 

405 410 415 
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Glu His Trp Met Val His Gin Aso Glu Lvs Gin Arg Lys Asp Lys vSl 

420 425 1 ~ 430- ,i 

Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asd 
435 440 * 445 

Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu Pro Trp Ser Trr> Ty 
450 455 460 

Gly Arg He Thr Leu Arg Ser Glu Leu Leu Gly Cys Ala Glu Glu Glu 
465 470 475 * 480 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCT CTT TAG TCA CCA CTC TCG CCC TCT CCA AGA ATT TGT TTA ACA AAG 
Ser Leu - Ser Pro Leu Ser Pro Ser Pro Arg He Cys Leu Thr Lys 
15 10 15 

CGC TGA GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT 
Axg * Gly Lys Arg Thr Ser Ser * He Leu * * Gly A-g Se*- 

20 25 30 

CTG CTG CTG CCC TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG 
Leu Leu Leu Pro Cys Ala Ala Thr Ser Ala Thr Leu Pro Ser Ala Thr 
35 40 45 

ACC CCT GAC CAG CCG GGG TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC 
Thr Pro Asp Gin Pro Gly Ser Arg Pro Gly Asp Gly He Met Lys Arg 
50 55 60 

TCG GTA GCC GTC TGG CTC TTG GTC GGG CTC AGC CTC GGT GTC CCC CAG 
Ser Val Ala Val Trp Leu Leu Val Glv Leu Ser Leu Gly Val Pro Gin 
65 70 75 80 

TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT 
Pne Gly Lys Gly Asp He Cys Aso Pro Asn Pro Cvs Glu Asn Glv Glv 

85 90 95 

ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA 
He Cys Leu Pro Gly Leu Ala Val Glv Ser Phe Ser Cvs Glu- Cvs o*-o 

100 105 * HO * 

GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT GCA TC* 
Asp Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ai a 
115 120 125 
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GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA T^C 432 

Asp Glu Glu Glu Pro Thr Ser Ala Glv Pro OJs Thr Pro Asn-Pro .Cys 
130 135 140 

* 

CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA TAC CGA GGG GAT ACA 430 

His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Arg Gly Aso Thr 
145 150 155 " ' ' 160 

TTC ATA GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC 528 

Phe lie Gly Tyr Val Cys Lys Cys Pro Arg Glv Phe Asn Gly lie His 

165 170 175 

TGT CAG CAC AAC ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT 57 6 

Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cvs Lys Asn Gly 

180 185 190 

GGA ATA TGT ACA GAT CTT GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC 524 

Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu Cys Pro Gly 
195 200 " 205 

GAA TTT ATG GGA AGA AAT TGT CAA TAC AAA TGC TCA GGC CCA CTG GGA 57 2 

Glu Phe Met Gly Arg Asn Cys Gin Tvr Lys Cvs Ser Glv Pro Leu Glv 
210 215 * " 220 

ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA ATC ACA GCT TCC TCT ACT 720 

He Glu Gly Gly He He Ser Asn Gin Gin He Thr Ala Ser Ser Thr 
225 230 235 240 

CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC TAC TAT GCA CGT 768 

His Arg Ala Leu Phe Gly Leu Gin Lys Tro Tvr Pro Tvr Tyr Ala Arg 

245 250 * 255 

CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA AAT GAC 816 

Leu Asn Lys Lys Gly Leu He Asn Ala Tro Thr Ala Ala Glu Asn Asr> 

260 265 270 

AGA TGG AAG CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 364 

Arg Trp Lys Arg Trp He Gin He Asn Leu Gin Arg Lvs Met Arg Val 
275 280 235 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT 912 

Thr Gly Val He Thr Gin Gly Ala Lys Arg lie Glv Ser Pro Glu Tyr 
290 295 " 300 

ATA AAA TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA 960 

He Lys Phe Tyr Lys lie Ala Tyr Ser Asn Aso Gly Lvs Thr Tro Ala 
305 310 315 " "320 

ATG TAC AAA GTG AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC 1008 

Met Tyr Lys Val Lys Gly Thr Asn Glu Aso Met Val Phe Arg Glv Asn 

325 330 335 

ATT GAT AAC AAC ACT CCA TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA 105 6 

He Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro He Lvs 

340 345 350 

GCT CAG TAT GTA AGA CTC TAT CCC CAA GTT TGT CGA AGA CAT TGC ACT 1104 

Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg His Cys Thr 
355 350 355 

TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG TCG GGT TGT TCT GAG CCT 1152 

Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cvs Ser Glu Pro 
370 375 * 330 
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CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG ATC ACT GCC ^CC 
Leu Gly Met Lys Ser Gly His lie Gin Asp iyr Gin lie Thr Ala ^er 
385 390 ~ 395 400 
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AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC ACT TGG GAA CCA AGG 1248 

Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu Pro Arg 

405 " 410 415 

AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 129 6 

Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr Ser Gly 

420 425 430 

CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC 1344 

His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val Pro Thr 

435 440 445 

AAA GTG ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA 1392 

Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe Gly His Val 

450 455 " 460 

CAG TTT GTT GGC TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC 144 0 

Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Aso Gly Glu His 

465 470 " 475 " 480 

TGG ACT GTA TAC CAG GAT GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG 1488 

Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys Val Phe Gin 

485 490 495 

GGA AAT TTT GAC AAT GAC ACT CAC AGA AAA AAT GTC ATC GAC CCT CCC 1536 

Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Aso Pro Pro 

500 505 510 

ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT TGG TCC TGG TAC GGG AGG 15 84 

lie Tyr Ala Arg His lie Arg lie Leu Pro Tro Ser Tro Tyr Gly Arg 

515 520 " 525 

ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG GAG GAA TGA GGG 163 2 

lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu Glu * Gly 

530 535 ~ * 540 

GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT ATC TCC 1680 

Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lvs Ser lie Ser 

545 550 ' 555 ~ ' 560 

ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 1728 

Met Glu * Thr Val * Asn Leu * Glu Thr Glu Tro Phe Phe Phe 

565 570 ' 575 

TTT TCA TGA AAA AGT GGT CAA ATT ATG GTA GGC AAC TAA CGG TGT TTT 177 6 

Phe Ser * Lys Ser Gly Gin lie Met Val Gly Asn * Arg Cys Phe 

580 585 590 



TAC C 
Tvr 



1780 



( 2 ) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: aiaino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: orotein *t 

* - i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lys 
15 10 ~ 15 

Arg 

INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Lys Arg Thr Ser Ser 
1 5 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Leu * * Gly Arg Ser Leu Leu Leu Pro Cvs Ala Ala Thr Ser 
1 5 10 ' 15 

Ala Thr Leu Pro Ser Ala Thr Thr Pro Aso Gin Pro Gly Se- Arg ^ro 

20 25 * 30 

Gly Asp Gly He Met Lys Arg Ser Val Ala Val Tro Leu Leu Val Glv 
35 40 45 

Leu Ser Leu Gly Val Pro Gin Phe Gly Lvs Gly Aso He Cys Aso Pro 
50 55 60~ 

Asn Pro Cys Glu Asn Gly Gly He Cys Leu Pro Glv Leu Ala Val Gly 
65 70 75 * 80 

Ser Phe Ser Cys Glu Cys Pro As? Glv Phe Thr Aso Pro Asn Cvs Se^ 

85 go 95 

Ser Val Val Glu Val Ala Ser Aso Glu Glu Glu Pro Thr Se- A^a Glv 

100 105 no 

Pro Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cvs Glu Ti e Ser 
115 120 1-25 

Glu Ala Tyr Arg Gly Asp Thr Phe He Gly Tvr Val Cvs Lys Cvs Pro 
130 135 ' 140 " 

Arg Gly Phe Asn Gly He His Cys Gin His Asn He Asn Glu Cvs Glu 
145 150 155 - 160 
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Val Glu Pro Cys Lys Asn Gly Gly He Cvs Thr Aso Leu Val*Ala Asn 

165 U70 175 

Tyr Ser Cys Glu Cys Pro Gly Glu Phe Met Glv Arg Asn Cys Gin Tv- 

180 185 190 

Lys Cys Ser Gly Pro Leu Gly He Glu Gly Gly He He Ser Asn Gin 
195 200 205 

Gin He Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lvs 
210 215 220 

Trp Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lvs Glv Leu He Asn Ala 
225 230 235 240 

Trp Thr Ala Ala Glu Asn Asp Arg Trp Lys Arg Tro He Gin He Asn 

245 250 255 

Leu Gin Arg Lys Met Arg Val Thr Glv Val He Thr Gin Gly Ala Lvs 

260 255 270 

Arg He Gly Ser Pro Glu Tyr He Lvs Phe Tvr Lvs He Ala Tvr Se- 
275 280 285 

Asn Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lvs Gly Thr Asn Glu 
290 295 300 

Asp Met Val Phe Arg Gly Asn He Aso Asn Asn Thr Pro Tyr Ala Asn 
305 310 315 320 

Ser Phe Thr Pro Pro He Lys Ala Gin Tvr Val Arg Leu Tyr Pro Gin 

325 ' 330 ' 335 

.Val Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys G^u 

340 345 350 

Leu Ser Gly Cys Ser Glu Pro Leu Glv Met Lvs Ser Gly His He G^n 
355 350 ' 365 

Asp Tyr Gin He Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Aso 
370 375 380 

Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Aso Lys Gin Glv Lvs 
385 390 398 ' - 400 

Val Asn Ala Trp Thr Ser Gly His Asn Aso Gin Ser Gin Tro Leu Gin 

405 410 " 415 

Val Asp Leu Leu Val Pro Thr Lys Vai Thr Glv lie He Thr Gin Glv 

420 425 430 

Ala Lys Asp Phe Gly His Val Gin Phe Vai Gly Ser Tyr Lys Leu A^a 
435 440 ' 445 

Tyr Ser Asn Asp Gly Glu His Trp Thr Vai Tvr Gin Aso Giu Lys G'n 
450 455 ' 450 

Arg Lys Asp Lys Vai Phe Gin Gly Asn Phe Aso Asn Aso Thr His A-q 
4o5 470 475 " 480 

Lys Asn Val He Asp Pro Pro lie Tyr Ala Arg His He Arg He L~u 

485 490 " 495 



( 



Pro Trp Ser Trp Tyr Gly Arg He Thr Leu Ala Ser Glu Leu^Leu Glv 

500 505 1 51fl£ 

Cys Thr Glu Glu Glu 
515 

(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie 
1 5 io 15 . 

Ser Met Glu * Thr Val * Asn Leu 

20 25 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Glu Thr Glu Trp Phe Phe Phe Phe Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Ser Gly Gin He Met Val Gly Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 9 amino acids 

(B) . TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Drotein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Cys Phe Tyr 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GACAGATGGC 


CATGGATTCA 


GATAAATTTG 


CAAAGAAAAA 


TGAGAGTCAC 


TGGTGTTATT 


60 


ACCCAAGGAG 


CAAAAAGGAT 


TGGAAGCCCA 


GAGTACATAA 


AATCCTACAA 


AATTGCCTAC 


120 


AGCAATGACG 


GGAAGACCTG 


GGCAATGTAC 


AAAGTAAAAG 


GCACCAATGA 


AGAGATGGTC 


180 


TTTCGTGGAA 


ATGTTGATAA 


CAACACACCA 


TATGCTAATT 


CTTTCACACC 


CCCAATCAAA 


240 


GCTCAGTATG 


TAAGACTCTA 


CCCCCAAATT 


TGTCGAAGGC 


ATTGTACTTT 


AAGAATGGAA 


300 


CTTCTTGGCT 


GTGAGCTC 










318 


(2) INFORMATION FOR SEQ ID NO: 20: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 




(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Cys Ser Thr Gin Leu Gly Met Glu Glv Gly Ala lie Ala Aso Ser Gin 

5 10 * 15 

lie Se r Ala ser Tyr Val Tyr Met Gly Phe Met Glv Leu Gin A-g t~ 0 

Gly Pro Glu Leu Ala Arg Leu , Tyr Arg Thr Gly He Val Asa Ala Tro 
•3a 40 45 

His Ala Ser Asn Tyr Asp Xaa Ser Lvs Pro Trp He Gin Val Asn Leu 
°° 55 60 

Leu Arg Lys Met Arg Val Ser Gly Val Met Thr Gin Gly Ala Ser Arg 

70 7 5 30 

Ala Gly Arg Ala Glu Tyr Leu Lys Thr Phe Lvs Val Ala Tv- ' ~u 

85 90 ' 
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Asp Gly Xaa Arg Lys ?he Glu Phe lie Gin Aso Glu Ser Gly (Jly Asd 

100 105 * • 110^ 

Lys. Glu Phe Leu Gly Asn Leu Asd Asn Asn Ser Leu Lvs Val Asn Met 
115' 120 125 

Phe Asn Pro Thr Leu Glu Ala Gin Tyr He Arg Leu Tyr *>ro Val Ser 
130 135 140 

Cys His Arg Gly Cys Thr Leu Arg Phe Glu Leu Leu Glv Cys Glu Leu 
145 150 155 ' 160 

Kis Gly Cys Leu Glu. Pro Leu Gly Leu Lys Asn Asn Thr He Pro Asp 

165 170 175 

Ser Gin Met Ser Ala Ser Ser Ser Tyr Lys Thr TrD Asn Leu Arg Ala 

180 185 " 190 

Phe Gly Trp Tyr Pro His Leu Gly Arg Leu Aso Asn Gin Gly Lys He 
195 200 ' 205 

Asn Ala Trp Thr Ala Gin Ser Asn Ser Ala Lys Glu Tro Leu Gin Val 
210 215 220 

Asp Leu Gly Thr Gin Arg Gin Val Thr Gly He He Thr Gin Gly Ala 
225 230 235 240 

Arg Asp Phe Gly His He Gin Tyr Val Glu Ser Tyr Lvs Val Ala His 

245 250 * 255 

Ser Asp Asp Gly Val Gin Trp Thr Val Tvr Xaa Xaa Glu Glu Gin Glv 

260 265 * 270 

Ser Ser Lys Val Phe Gin Gly Asn Leu Aso Asn Asn Ser His Lvs Lys 
275 2S0 " 235 * 

Asn He Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arc Val Leu <>ro 
290 295 " 300 

Val Ser Trp His Asn Arg He Thr Leu Arg Leu Glu Leu Leu Glv Cvs 
305 310 315 320 

(2) INFORMATION FOR SEQ ID NO; 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 321 amino acids 
(3) TYPE: amino acid 

(C) STRANDSDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

"Cys Ser Gly Pro Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin 
1 5 10 # is 

He Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Tro 

20 2o 30 
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Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu lie AsrtTAla Tr? 
35 40 ^ 45 _ z 

Thr. Ala Ala Glu Asn Asp Arg Trp Asn Arg Tro lie Gin He Asn Leu 
50 55 * 60 

Gin Arg Lys Met Arg Val Thr Gly Val He Thr Gin Gly Ala Lys Ara 
65 70 75 80 

He Gly Ser Pro Glu Tyr He Lys Phe Tyr Lys He Ala Tvr Ser Asn 

85 90 " 95 

Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Asd 

100 105 J no - 

Met Val Phe Arg Gly Asn He Asp Asn Asn Thr Pro Tyr Ala Asn Se- 
115 120 125 

Phe Thr Pro Pro He Lys Ala Gin Tyr Val Arg Leu Tvr =>ro Gin Val 
i30 13 5 140 

Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cvs Glu Leu 
145 150 155 " 160 

Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin Aso 

165 170 ' 175 

Tyr Gin He Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Aso Met 

180 185 190 

Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lvs Gin Gly Lys Val 
195 200 * * 205 

Asn Ala Trp Thr Ser Gly His Asn Aso Gin Ser Gin Tro Leu Gin Va* 
210 215 220 

Xaa Leu Leu Val Pro Thr Lys Val Thr Gly He He Thr Gin Gly Ala 
225 230 235 240 

Lys Asp Xaa Gly His Val Gin Phe Val Gly Ser Tyr Lvs Leu Ala Tyr 

245 250 * 255 

Ser Asn Asp Gly Glu His Trp Thr Val Xaa Gin Aso Glu Lys Gin Axq 

260 265 ' 270 

Lys Asp Lys Val Xaa Gin Gly Asn Phe Aso Asn Aso Thr His Arq Lvs 
275 280 "285 

Asn Val He Asp Pro Pro He Tyr Ala Arg His He Arg lie Leu *ro 
290 295 300 

Trp Ser Trp Tyr Gly Arg He Thr Leu Ala Ser Glu Leu Leu Glv Cvs 
305 31 ° 315 * 320 

Thr 



INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 
(C) STRANDED NESS : 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Lys Arg Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly 

1 5 in 



5 10 

Val Pro Gin Phe Gly Lys Gly Aso lie 

20 " 25 

INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: De^tide 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly He Cvs Leu Pro Gly Leu 
1 5 10 ■ " 15 

Ala Val Gly Xaa Xaa Xaa Xaa Xaa Ser Phe Ser Cvs Glu Cys Asn 

20 25 * 30 

Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser Asd 
35 40 45 

Glu Glu Glu Pro Thr Ser Ala Giv Pro 
50 55 

INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cvs Glu lie Se- Glu 

5 10 15 

Ala Tyr Arg Gly Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg 

20 25 30 
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Gly Phe Asn Gly He His Cvs Gin His Asn He 
35 40 , 

) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Cys Glu Val Glu Pro Cys Lys Asn Glv Gly He Cvs Thr Aso Leu Val 
1 5 10 " 15 

Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tvr Ser Cys Glu Cvs Pro Gly 

20 25 * 30 

Glu Phe Met Gly Arg Asn Cys Gin Tyr Lvs 
35 ' 40 

) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Xaa Xaa Xaa Pro Cys Xaa Asn Gly Gly Xaa Cys Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tvr Xaa Cvs Xaa Cvs Xaa Xaa 

20 25 36 

Gly Tyr Xaa Gly Xaa Xaa Cys Xaa 
3 5 40 

) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 
( 3 ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
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(A) NAME /KEY: CDS 

(3) LOCATION: 1..309 y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
NGTGATATTT GTGATCCCAA TCCATGTGAA AATGGAGGTA TCTGTTTGCC AGGATTGGCT 60 
GTAGGTTCCT TTTCCTGTGA GTGTCCAGAT GGCTTCACAG ACCCCAACTG TTCTAGTGTT 120 
GTGGAGGTTG GTCCCTGCAC TCCTAATCCA TGCCATAATG GAGGAACCTG TGAAATAAGT 180 
GAAGCATACC GAGGGGATAC ATTCATAGGC TATGTTTGTA AATGTCCCCG AGGATTTAAT 240 
GGGATTCACT GTCAGCACAA CATAAATGAA TGCGAAGTTG AGCCTTGCAA AAATGGTGGA 300 
ATATGTACAG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 
(3) TYPE: nucleic acicf 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
(3) LOCATION: 549.. 1211 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



GAATTCCGGG 


AGGGAGGGTA 


GGGGGGCGGG 


CCGCGGGGGG 


CCAAAGCCAG 


CTAGGCTCAG 


60 


TCTCACACGC 


GCGCCGCCAC 


TGTTTGTATA 


TAGTGCGCTC 


CTGGCCTCAG 


GCTCGCTCCC 


120 


CTCCAGCTCT 


CGCTTCATTG 


TTCTCCAAGT 


CAGAAGCCCC 


CGCATCCGCC 


GCGCAGCAGC 


180 


GTGAGCCGTA 


GTCACTGCTG 


GCCGCTTCGC 


CTGCGTGCGC 


GCACGGAAAT 


CGGGGAGCCA 


240 


GGAACCCAAG 


GAGCCGCCGT 


CCGCCCGCTG 


TGCCTCTGCT 


AGACCACTCG 


CAGCCCCAGC 


300 


CTCTCTCAAG 


CGCACCCACC 


TCCGCGCACC 


CCAGCTCAGG 


CGAAGCTGGA 


GTGAGGGTGA 


360 


ATCACCCTTT 


CTCTAGGGCC 


ACCACTCTTT 


TATCGCCCTT 


CCCAAGATTT 


GAGAAGCGCT 


420 


GCGGGAGGAA 


AGACGTCCTC 


TTGATCTCTG 


ACAGGGCGGG 


GTTTACTGCT 


GTCCTGCAGG 


430 


CGCGCCTCGC 


CTACTGTGCC 


CTCCGCTACG 


ACCCCGGACC 


AGCCCAGGTC 


ACGTCCGTGA 


540 


GAAGGGATCA 


TGAAGCACTT 


GGTAGCAGCC 


TGGCTTTTGG 


TTGGACTCAG 


CCTCGGGGTG 


500 


CCCCAGTTCG 


GCAAAGGTGA 


CATTTGCAAC 


CCGAACCCCT 


GTGAAAATGG 


-TGGCATCTGT 


550 


CTGTCAGGAC 


TGGCTGATGA 


TTCCTTTTCC 


TGTGAGTGTC 


CAGAAGGCTT 


CGCAGGTCCG 


720 


AACTGCTCTA 


GTGTTGTGGA 


GGTTGCATCA 


GATGAAGAAA 


AGCCTACTTC 


AGCAGGTCCC 


730 


TGCATCCCTA 


ACCCATGCCA 


TAACGGAGGA 


ACCTGTGAGA 


TAAGCGAAGC 


CTATCGAGGA 


340 




GACACATTCA 


TAGGCTATGT 


TTGTAAATGT 


CCTCGGGGAT 


TTAATGGGAT 


TCACTfTTCAG 


900 


CACAATATAA 


ATGAATGTGA 


AGCTGAGCCT 


TGCAGAAATG 


GCGGAATATG 


TACCGACCTT 


950 


GTTGCTAACT 


ACTCTTGTGA 


ATGCCCAGGA 


GAATTTATGG 


GACGAAATTG 


TCAATATAAA 


1020 


TGCTCTGGGC 


ACTTGGGAAT 


CGAAGGTGGG 


ATCATATCTA 


ATCAGCAAAT 


CACAGCTTCA 


1080 


TCTAATCACC 


GAGCTCTTTT 


TGGACTCCAG 


AAGTGGTATC 


CCTACTATGC 


TAGACTTAAT 


1140 


AAGAAGGGCC 


TTATAAATGC 


CTGGACAGCT 


GCTGAAAATG 


. ACAGATGGCC 


ATGGATTCAG 


1200 


GTAACAGTGG 


GATGAGACAA 


ATCCATTTCC 


CAAATTATCA 


GAATCATTAT 


AGAAGTAGGT 


1250 


TAGGGAGAAT 


TGGCTGTGAT 


TCTTTCTCAT 


GGTTAAAATG 


TGATTTAGTT 


CAGAATTAAC 


1320 


ATGGTTGGAA 


ACTCTAAAAA 


ATGTGGAAAA 


CAGGAACATT 


CTATGTCTGA 


AAAT CTG AAA . 


1380 


AT AG CAT C AA 


GATGAAAACA 


TTCTTTAGTC 


ATAAATATAC 


TCTTTTAAGT 


TATAGTAGAG 


1440 


AAAAAG AT C T 


TATCATTTCA 


TAAGTGGACT 


TTTGGGATAG 


CATTGGAAAT 


GTAAATGAAA 


1500 


TAAATACCTA 


ATT G AAAAAA 


GTTTATTCTA 


AAGTGTTAAT 


ATTTAGCAAC 


AGATTCAGAG 


1550 


ACAAGAAAGT 


AACAATTCAA 


TCTGTGTATT 


TTTTGTGAGA 


AATAGTTTCC 


CATGTGCAAA 


1520 


TATAAAGTGC 


GCATCATATC 


ATGATAATAT 


CCAACTGTCT 


GCAGAACTCC 


CTTTCATAAA 


1680 


TGAGAGAATT 


TTAATTCATA 


GTGCCTTATA 


TCCTCATCAG 


CCATCTGACT 


TTACTACAGA 


1740 


AGAAAACAAT 


GAAATGATGC 


ATTAAGTGCT 


TTGCTAGAAG 


AAACATCATA 


GCAAAGCTGA 


1800 


TAGCCCACAT 


TCTGTGCANN 


NAAGCTTCCA 


GAGCACTCGA 


G AAAAAG C AG 


AAATG AG AT G 


1850 


TTTTATGAAA 


ACCGAAAAGA 


TAATCTGATT 


TCTGTGAAAT 


ATACTTTTGA 


TCATGTGGTT 


1920 


CTTi AAGATA 


GTCACTAACA 


AGTCATTAGT 


AGCAGATACC 


AAATGGGAGA 


AAATTTCCAG 


I960 


TATACTGAGG 


GTCAAGGCAG 


TCATGCTGAA 


ACTACATGAG 


GTCAGGAAAG 


TTTTGAAATA 


2040 


AGGTGATTTT 


GGAAGGATAC 


CTTCAACTGG 


CCTAGATTTT 


CAAGAAACAG 


TGTAATCAAC 


2100 


AGCCAAACAT 


GAGAATCTAG 


CTAACAGCAT 


TTAGAAAACC 


AGAACTAAGA 


GTGTTACTGG 


2160 


GGAATTGCAT 


TTAAATCCAG 


TATGAGAGTT 


TGCAAATGCC 


GTATTCTTCT 


AAGGGGTTTG 


2220 


TGCCACATTT 


TGTTACCATG 


GAGTCCTCTG 


TAAGAACTTT 


ATTAGATAAA 


TCATCTTTAC 


2280 


ACTATAATTT 


GAATAAAAGC 


CGGAATTC 








2308 


(2) INFORMATION FOR SE 


Q ID NO:29: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 anino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 

Met Lys Arg Ser Vai Ala Val Trp Leu Leu Vai Giv Leu Ser Leu Gl 

1 5 in' . _ 
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Val Pro Gin Phe Gly Lys Glv Aso lie Cvs Asd Pro Asn ?ro«*€ys G 1 u 

20 25 \ " 30 .t' 

Asn Gly Gly lie Cys Leu Pro Gly Leu Ala Val Gly Ser Phe'Ser Cvs 
35 40 45 

Giu Cys Pro Asp Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Giu 
50 55 60 

Val Ala Ser Asp Giu Giu Giu Pro Thr Ser Ala Glv Pro Cvs Thr Pro 
65 70 75 80 

Asn Pro Cys His Asn Gly Gly Thr Cys Giu He Ser Giu Ala Tyr Arg 

85 go 95 

Gly Asp Thr Phe He Gly Tyr Val Cvs Lvs Cys Pro Arg Gly Phe Asn 

100 105 110 

Gly He His Cys Gin His Asn He Asn Giu Cys Giu Val Giu Pro Cvs 
115 120 125 

Lys Asn Gly Gly He Cys Thr Aso Leu Val Ala Asn Tyr Ser Cys Giu 
130 135 * 140 

Cys Pro Gly Giu Phe Met Gly Arg Asn Cvs Gin Tvr Lvs Cys Ser Gly 
145 150 155 ' " 160 

Pro Leu Gly He Giu Gly Gly lie lie Ser Asn Gin Gin He Thr Ala 

165 170 175 

Ser Ser Thr His Arg Ala Leu Phe Glv Leu Gin Lvs Tro Tyr Pro Tyr 

180 185 ' " 190 

Tyr Ala Arg Leu Asn Lys Lys Glv Leu He Asn Ala Tro Thr Ala A*a 
I 95 200 205 

Giu Asn Asp Arg Trp Lys Arg Trp He Gin He Asn Leu Gin Arq Lvs 
210 215 ~ 220 

Met Arg Val Thr Gly Val He Thr Gin Glv Ala Lvs Arg He Glv Ser 
225 230 235 " - 240 

Pro Giu Tyr He Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lvs 

245 250 * " 255 

Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Giu Aso Met Val Phe 

260 255 ' 270 

Arg Gly Asn He Asp Asn Asn Thr Pro Tvr Ala Asn Ser Phe Thr Pro 
275 230 ' 285 

Pro He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg 
290 295 300 

His Cys Thr Leu Arg Met Giu Leu Leu Glv Cvs Giu Leu Se^ GW Cvs 
305 310 ' 315 * 320 

Ser Giu Pro Leu Gly Met Lys Ser Giy His lie Gin Aso Tyr Gin lie 

325 330 335 

Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Aso Met Phe Thr Tro 

3^0 345 " 350 
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Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lvs Val AsnrAla Trp 

355 360 i " " 3.65 r ? 

Thr Ser Gly His Asn Asp Gin Ser Gin Tro Leu Gin Val Xaa Leu Leu 
370 375 380 

Val Pro Thr Lys Val Thr Gly He He Thr Gin Gly Ala Lys Aso Xaa 
385 390 395 " 400 

Gly His Val Gin ?he Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Aso 

405 410 " 415 

Gly Glu His Trp Thr Val Xaa Gin Aso Glu Lys Gin Arg Lys Aso Lvs 

420 425 ' 430- * " 

Val Xaa Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie 

435 440 445 

Asp Pro Pro He Tyr Ala Arg His He Arg He Leu Pro Tro Ser T-o 
450 455 460 

Tyr Gly Arg He Thr Leu Ala Ser Glu Leu Leu Gly Cvs Thr Glu Glu 
465 470 475 

Glu 
480 
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